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1, INTRODUCTION 

Power industry nowadays is concern about power quality issues because of the customers demand. 
The typical PQ disturbances are voltage sags, transient, harmonics, and voltage swells. This disturbance can 
affect to the economical losses for industry such as the quality of the industry products [1]. Many types of 
equipment nowadays are using power electronics and more sensitive to this kind of disturbances where the 
damage gives high impact in terms of cost [2]. Power quality is referred how near the voltage waveform 
being sinusoidal at rated magnitude and frequency [3]. Voltage sags and swells are the most common types 
of power quality disturbances that familiar has been discussed [4]. Most of the discussed disturbance are 
occur at phase connection. However, some study shows that such disturbance also could occur at neutral to 
ground connection [5-8]. There are some factors that cause high NG voltage. One of the factors is the phase 
current imbalance that caused by a large number of single phase loads applied to one phase elsewhere [9]. 
The ground connection plays a big part to this event, where poor earth/ground connection at local ground 
point or at the substation can affect the NG voltage. 

Based on the corresponding problems, the types of high NG voltage should be classified where the 
corrective work can perform on right place. In this case, the data preprocessing 1s utilized in order to get a 
better model for classification of the NG voltage. There are a number of methods which being used for data 
preprocessing of voltages signals. Visual inspection method is used in the past and it is seemed not practical 
nowadays where the voltage signals involve a large number of data [10]. This method required someone who 
has a good knowledge on how to gain desired information from the voltage signals. In the past studies, there 
are modern methods such as fractal based method, S-transform (ST) method, Fast Fourier Transform (FFT) 
method, Short Time Fourier Transform (STFT) method and Discrete Wavelet Transform (DWT) [11-14]. 

In this paper, wavelet transforms (WT) method is proposed for data preprocessing methods and 
artificial neural network is proposed for classification of the NG voltage. This method can apply various 
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lengths of windows according to the amount of signal frequencies [15]. WT can analyze PQ disturbance 
simultaneously in both time and frequency domains [16-18]. For classification of the power quality 
disturbances, Artificial Neural Network (ANN) method is widely use in the past studies. ANN has the ability 
to classify noise data in the large number of data [16]. ANN has the ability to implicitly detect complex 
nonlinear relationships between dependent and independent variables. It is also having the ability to detect all 
possible interactions between predictor variables and has the availability of multiple training algorithms. 
The ANN model developed based on Levenberg Marquadt as proposed by [19] was used in order to classify 
the high NG voltage component. The Levenberg Marquardt algorithm which known as LMNN has chosen 
due to the shown better performance in terms of accuracy as mentioned by [20]. 


2. RESEARCH METHOD 
2.1. Data Preparation 

There are two types of NG voltage component that have been analyzed and need to be classified in 
this study. One of these high NG voltage components is due to harmonics where this component is originated 
from nonlinear load. Another high NV is known as transient which due to the starting load or fault. In this 
paper, 9 event of high NG voltage are recorded using Fluke Analyzer in 9 different places. Figure | shows all 
the 9 NG voltage signal. 
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Figure 1. All 9 NG voltage signals 


In order to get the best classification result, feature extraction need to be chosen correctly. In this 
paper, the WT based on Discrete Wavelet Transforms (DWT) is used in order to extract the information. 
DWT provides sufficient information with less computation time sufficiently. DWT can be expressed as; 


DWI yo j,k) = | xy jdt 
= (1) 
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Where, j and k are the integers of scale parameter and shift parameter respectively. Yj @) is the 


mother wavelet. 
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The choices of mother wavelet plays significant role where higher scale signal decomposition 1s 
needed [3]. In order to decompose the NG signal, the Daubechies (db) wavelet technique is implemented. 
From the db wavelet family, forth order of Daubechies (Daub4) is chosen in this paper as it is the simplest 
one where Daub4 is similar as the Haar transform. Daub4 is the most localized, 1.e. compactly supported, 
in time [21]. From Daub4 wavelet transform, the mean absolute deviation (MAD) and standard deviation 
(SD) at level 4 are chosen as the input for two types of high NG voltage component. MAD 1s the average 
absolute deviation of a data set from a central point and SD is a measure of how spread out a distribution. In 
other words, they are measures of variability. 


2.2. Classification Stages 

The LMNN is a very simple technique but enable to gives the performance of result with high 
accuracy. The LMNN is a powerful tool that frequently to be used in many applications in term of solving 
generic curve-fitting problems. Thus, in this case, the LMNN is utilized to classify the types of NTEV rise in 
electrical system. 

In order to obtain best performance of classifier, the required training parameters of this LMNN 
such as learning rate, momentum rate and hidden layer must be optimized first. Figure 2 show the inputs and 
outputs for classification of high NG voltage. 


INPUTS 
MAD 
(High NG Voltage due to 
Harmonic) 
SD 
(High NG Voltage due to 
Harmonic) 
MAD 
(High NG Voltage due to 
Transient) 
SD 
(High NG Voltage due to 
Transient) 


OUTPUTS 


High NG Voltage due to 
Harmonic 
High NG Voltage due to Transient 
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Figure 2. The inputs and outputs for classification of high NG voltage 
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Figure 3. The flow chart for classification of high NG voltage 
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Figure 3 shows seven steps need to be implemented for classification of high NG voltage. First step 
involves the conversion of data from the wavelet transform into MATLAB file. For the next process, the data 
were imported into the MATLAB. The values of data must be in a small number or in the specified range. 
This lead to the second steps, where data normalization must be done first so that the values of the data are 
more appropriate to be an input for the ANN. Training and testing process are the most important for ANN. 
For the third step, dividing the data into two group by 70% of the data are used in training process and the 
other 30% are used in testing process. For better classification results, the training data is not used for the 
testing process. This data is clustered into train input, train target, test input and test target using a 
MATLAB script. 

In this paper, the data 1s modeled by 54 numbers of data which is consist 45 numbers of data from 
high NG voltage due to harmonic and 9 numbers of data from high NG voltage due to transient. By using 
70:30 ratio, the numbers of data that used in training process 1s 38 and 16 numbers of data is use for testing 
purposes. Optimization process of the required training parameters are in the step number 4. Learning rate 
and momentum rate are in the range of O to 1 and it is varies from 0.1 to 1 with increment of 0.1. For learning 
rate optimization, momentum rate is set to its maximum value which is | and the hidden layer is set to 10. 
By comparing the mean square error (mse) from 0.1 to 1, the lowest mean square error from the increment is 
chose as the optimal values for learning rate. The optimal values for learning rate is then used to determine 
the momentum rate. Same step is applied to get the optimal momentum rate. The optimal values of the 
hidden layer then are determined using both optimal values of the learning rate and momentum rate. 
The error goal is set to 0.01 and epochs are set to 100. All of these parameters are used for training and 
testing process. The optimize parameters of this paper is shown in Table 1. 


Table 1. The Optimize LMNN Parameters 


Parameters Optimal Value 

Nodes in input layer 4 
Noutput layer size 3 

Number of nodes in hidden layer (hl) 0.1 

Learning rate (Ir) 0.2 
Momentum rate (mr) 9 

Epochs 100 

Error goal 0.01 


Step 5 involves the decision process of the LMNN. To proceed to the next step, the system need to 
achieve the converge state during the training stage. The system will return to the previous step if the 
converge state and regression value do not met the given criteria. Step 6 is the classifying process stage for 
training and testing. All the high NG voltage for both training and testing will be separated into their own 
group either high NG voltage due to harmonic or to high NG voltage due to transient. In order to separate 
these two types of outputs, a threshold is set in range of value | to 2. 1.5 is the threshold value that 1s set in 
this LMNN model. This threshold will classify what group 1s the high NG voltage belongs to. As for the final 
step, the prediction and classification results are display in graph. 


3. RESULTS AND ANALYSIS 

Using MATLAB R201 la version, the high NG voltage signal were analysed using wavelet toolbox. 
The signal that represents the data is then sampled into certain period of time before it was analysed using 
One-Dimensional Discrete Wavelet Analysis Tool. By using the toolbox, the forth order Daubechies (Daub4) 
was chose and the data were analysed by using decomposition from level | to level 5. This analysed process 
is done repeatedly and properly in order to get the best feature extraction from the signal where it is used as 
the inputs for the ANN. From the analysis process, it 1s shown the Daub4 decomposition at level 4 has 
significant value for all data. The details decomposition at level 4 is chose for statistical analysis and the 
Median Absolute Deviation (MAD) and Standard Deviation (SD) values are used as the inputs for the ANN 
model. 

Figure 4 (a) and (b) shows the decomposition from level 1 to 5 for high NG voltage due to harmonic 
and transient respectively. 

There are total 2 inputs and 2 outputs for this LMNN. There are 38 numbers of data is used for the 
training process and 16 numbers of data is used for the testing process. From the simulation of the ANN, 
the best of training performance was record at epochs | with the value of 0.0093067. The value of regression 
(R) is 0.97469 (almost 1) which indicate that the LMNN is in a good performance. 
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Figure 5 (a) and (b) shows the simulation result for best training performance of the LMNN and the 
regression of the ANN respectively. 


Decomposition at level 5:68 =a + d5 4044 d34+d2+ 01. Decomposition atlevelS:8=a5+ d54+d44¢d3+d24+d1. 
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Figure 4. The decomposition from level 1 to 5 for high NG voltage due to (a) harmonic (b) transient 


Best Training Performance is 0.0093067 at epoch 1 Training: R=0.97469 


Mean Squared Error (mse) 
Output ~= 1*Target + 0.014 
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Figure 5. Training result (a) the best training performance (b) regression 


The result of the prediction and the classification of training and testing data are shown in 
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(a) (b) 


Figure 6. The prediction and classification of (a) training (b) testing 
From the prediction of testing graph, it is shown that all the values of data are nearly fall to its 
respective group. The accuracy of the prediction for training and testing 1s then calculated. Table 2 show the 
value of error and the accuracy for both training and testing. 


Table 2. Testing Prediction Error 


Test Predict Test Output Error Accuracy (%) 
0.97644 1 -0.02356 97.64 
0.99644 1 -0.00356 99.64 
0.92663 1 -0.07337 92.64 
0.97479 1 -0.02521 97.48 
0.99073 1 -0.00297 99.70 
0.97550 1 -0.02450 97.55 
0.99831 1 -0.00169 99.83 
0.99500 1 -0.00500 99.50 
0.99560 1 -0.0040 99.56 
0.97859 1 -0.02141 97.86 
0.98839 1 -0.01161 98.84 
0.99253 1 -0.00747 99.25 
1.98476 2 -0.01524 99.24 
0.97584 1 -0.02416 97.58 
0.9969 1 -0.00031 99.97 
0.99830 1 -0.00170 99.83 


From Table 2, there are 15 numbers of data that are classified into group | and only 1 number of 
data classified into group 2. The table shows the accuracy of the prediction for testing of the LMNN are all 
90% and above for all data. The mean square error that obtains from testing process 0.000544. This is 
indicated the good performance of the LMNN model and it make perfect classification for the two desired 
outputs. 


4. CONCLUSION 

This paper has presented the classification of high NG voltage due to the harmonic and transient 
using ANN. In this paper, the LMNN classifier has successful developed to perform the classification of NG 
voltage accurately. The result shows that the accuracy of testing prediction error 1s 90% and above. Further, 
the mean square error that obtains from the testing process is small, where it value is 0.000544. The value of 
the accuracy and the mean square error are acceptable for the classification of the neutral to ground voltage 
components. Thus, it concluded that the LMNN classifier able to perform a high accuracy in classifying the 
types of NG voltage. 
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